On partial least squares dimension reduction for microarray-based classification: a simulation study
نویسندگان
چکیده
In microarray tumor tissue classi'cation studies, the expressions of thousands of genes (variables) are simultaneously measured across a few tissue samples. Standard statistical methodologies in classi'cation do not work well when the dimension, p, is greater than the sample size, N . One approach to classi'cation problems, when p N , is to 'rst apply a dimension reduction method and then perform the classi'cation in the reduced space. In this paper, we study dimension reduction for classi'cation in high dimension based on partial least squares (PLS) and principal components analysis (PCA). In addition, we propose and explore two hybrid-PLS methods for dimension reduction. PLS components are linear combinations of the original predictors, but the weights are nonlinear functions of both the predictors and response variable. This makes it di:cult to study the PLS classi'cation methodologies analytically, so, in this paper, we turn to a numerical study using simulation. c © 2003 Elsevier B.V. All rights reserved.
منابع مشابه
Orthogonal Projection Weights in Dimension Reduction based on Partial Least Squares
Dimension reduction is important during the analysis of gene expression microarray data, because the high dimensionality in the data set hurts the generalization performance of classifiers. Partial least squares based dimension reduction (PLSDR) is a frequently used method, since it is specialized in handling high dimensional data set and leads to satisfying classification performance. However,...
متن کاملOn the Number of Partial Least Squares Components in Dimension Reduction for Tumor Classification
Dimension reduction is important during the analysis of gene expression microarray data, because the high dimensionality of data sets hurts the generalization performance of classifiers. Partial Least Squares (PLS) based dimension reduction is a frequently used method, since it is specialized in handling high dimensional data set and leads to satisfying classification performance. This paper in...
متن کاملMicroarray-based classification and clinical predictors: on combined classifiers and additional predictive value
MOTIVATION In the context of clinical bioinformatics methods are needed for assessing the additional predictive value of microarray data compared to simple clinical parameters alone. Such methods should also provide an optimal prediction rule making use of all potentialities of both types of data: they should ideally be able to catch subtypes which are not identified by clinical parameters alon...
متن کاملA note on between-group PCA
In the context of binary classification with continuous predictors, we proove two properties concerning the connections between Partial Least Squares (PLS) dimension reduction and between-group PCA, and between linear discriminant analysis and between-group PCA. Such methods are of great interest for the analysis of high-dimensional data with continuous predictors, such as microarray gene expre...
متن کاملPLS dimension reduction for classification with microarray data.
Partial Least Squares (PLS) dimension reduction is known to give good prediction accuracy in the context of classification with high-dimensional microarray data. In this paper, the classification procedure consisting of PLS dimension reduction and linear discriminant analysis on the new components is compared with some of the best state-of-the-art classification methods. Moreover, a boosting al...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 46 شماره
صفحات -
تاریخ انتشار 2004